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A PROCESS TO STUDY CHANGES 
IN GENE^ EXgRESSIOrjLI N T LYMPHOCYTES 

Inventors: Yatindra Prashar and Sherman Weissman 

This application is based on provisional application 60/084,329, filed May 5, 
5 1998, which is incorporated herein by reference in its entirety. This application is 
related to applications serial No. 08/510,032 and serial No. 08/688,514, both of which 
are herein incorporated by reference in their entirety. This application is also related 
to provisional applications serial No. 60/056,844 (Atty. Dkt. No. 044574-5003) and 
serial No. 60/056,861 (Atty. Dkt. No. 044574-5014) which are herein incorporated by 
10 reference in their entirety. 

Technical Field 

This invention relates to compositions and methods that are useful to identify 
agents that modulate the response of T lymphocytes (T cells) to a variety of foreign 
antigens and superantigens, and to modulate the role of T lymphocytes in immune 
15 deficiency diseases, cancer, tissue transplantation and immune disorders. The 
invention also relates to compositions and methods of identifying agents that 
modulate the differentiation of prothymocytes into specific T lymphocyte 
subpopulations. 

20 Background of the Invention 

The immune system is organized to solve the problems of rapid and specific 
recognition of an enormous number of potential antigens by cell-to-cell collaboration 
and by clonal expansion of cells specific for a given antigen. The immune response 
depends on T lymphocytes acting at various steps. One significant cell-to-cell 
25 collaboration in which T lymphocytes take part is the interaction between antigen 

presenting cells such as macrophages. Another significant cell-to-cell collaboration is 
between T lymphocytes (helper T lymphocytes) and antibody producing B cells. In 
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both of these collaborations, the ability of T lymphocytes to respond in an antigen- 
specific manner is central to the strategy of the immune system. 

Besides recognizing foreign protein antigens, T lymphocytes also retain a 
functional memory of virtually all self-proteins which enables then to distinguish 

5 between self and nonself antigens. T lymphocytes are born in the bone marrow and 
mature in the thymus where they learn tolerance to self antigens. It is this ability to 
recognize self from nonself antigens that allow T lymphocytes to recognize and 
respond to potential invaders. 

T lymphocytes are divided into subpopulations on the basis of function and 

10 phenotypic markers. Different T lymphocyte subpopulations function to help in 

antibody formation (T helper cells, T H ), to kill target cells (T-cytotoxic cells, T CTL ), to 
induce inflammation (T-delayed hypersensitivity cells, T DTH ), and to inhibit immune 
responses (T-suppressor cells, T s ). T helper cells are further divided into two major 
subpopulations: T H1 and T H2 . T H1 cells function primarily as helper cells for induction 

15 of B-cell proliferation and differentiation to IgG-producing plasma cells; whereas T H2 
cells produce factors (e.g., interleukins) that induce B cells to differentiate and 
produce IgE and IgA. T H1 cells may further differentiate into T DTH cells that are 
responsible for the inflammatory effects of T lymphocytes (e.g., delayed 
hypersensitivity, cytotoxicity) and secrete inflammatory mediators, such as 

20 lymphotoxin and interferon-y (IFN-y). cell products may also stimulate 
differentiation of other white blood cells such as eosinophils and basophils. 

T lymphocytes respond to antigen by cell activation and proliferation. This 
activation and proliferation coincides with a release of many effector molecules (e.g., 
interleukins) that activate or deactivate other lymphocytes, contribute to immune- 

25 mediated inflammation (lymphokines), or interact with other cell types. For instance, 
interleukins produced by T H cells are required to induce activation and differentiation 
of B cells. At least 20 different interleukins have now been identified and are 
numbered from interleukin-1 (IL-1) to interleukin-20 (IL-20). T H1 cells produce IL-2, 
IFN-y and tumor necrosis factor in response to antigen stimulation, whereas cells 

30 synthesize IL-4, IL-5 and IL-6. Activation and differentiation of T CTL cells result in 
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the appearance of cytoplasmic granules (secretory lysosomes). The T CTL granules 
contain perforin, granzymes, and proteoglycans that act to lyse or kill other cells. 
Therefore, the T lymphocyte population not only contains a variety of effector cells, 
but also is the master regulator of the immune system; the T lymphocyte is the 

5 director of the immunological orchestra of cells and proteins. T lymphocytes function 
to turn off or on other cells in the immune system via T suppressor cells, T helper 
cells and T contrasuppressor cells. Stewart Sell, Immunology, 
Immunopathology & Immunity 30-33 (1996). T lymphocytes regulate (directly or 
indirectly) virtually all aspects of host immune resistance to infection, which include 

10 macrophage activity, antibody synthesis, synthesis of other inflammatory proteins, 
generation of inflammatory cells in the bone marrow and their recruitment into the 
circulation, and differentiation of other effector and regulatory lymphocytes. 

The release of effector molecules and other changes in T lymphocytes during 
activation are preceded by changes in the expression levels of many genes. The 

15 regulation of expression levels of numerous T lymphocyte genes has been an area of 
extensive study (see Ullman et al., (1990) Ann. Rev, Immunol., 8: 421-452 and Kelly 
et al, (1995) Curr. Opin. Immunol. 7: 327-332). The present inventor has utilized the 
analysis of differential gene expression on a more global level, as opposed to the 
analysis of the expression levels of individual RNA species, to provide simultaneous, 

20 near-quantitative information about the levels of gene expression for the multitude of 
genes whose expression levels are modulated during T lymphocyte activation. 

Summary of the Invention 

While the role(s) of T lymphocytes and T lymphocyte subpopulations in 
cancer, infectious disease, and autoimmune and immunodeficiency disorders have 
25 been subject of intense study, the techniques of differential gene expression have not 
been exploited to identify therapeutic or prophylactic agents that modulate the 
response of T lymphocytes in these various roles. The present invention provides an 
approach to systematically assess the transcriptional response from lymphocytes 
activated through contact with a pathogen or from T lymphocytes isolated from a 
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subject with an infectious diseases, immune disorder, GVHD or neoplasm involving T 
lymphocytes. 

One preferred embodiment relates to a method to identify a therapeutic or 
prophylactic agent that modulates the response of a T lymphocyte population to an 

5 antigen. This method comprises the steps of (1) preparing a first gene expression 
profile of a quiescent T lymphocyte population; (2) preparing a second gene 
expression profile of a T lymphocyte population exposed to the antigen; (3) treating 
the exposed T lymphocyte population with the agent; (4) preparing a third gene 
expression profile of the treated T lymphocyte population; (5) comparing the first, 

10 second and third gene expression profiles; and (6) identifying agents that modulate the 
response of a lymphocyte population to the antigen. 

The invention also relates to a method to identify a therapeutic agent that 
modulates a T lymphocyte population found in a subject having a sterile inflammatory 
disease, autoimmune disorder, immunodeficiency disease, cancer or graft-versus-host 

15 disease (GVHD). This method comprises the steps of: (1) preparing a first gene 
expression profile of a T lymphocyte population in a subject having the sterile 
inflammatory disease, autoimmune disorder, immunodeficiency disease, cancer, or 
GVHD; (2) treating the T lymphocyte population with the agent; (3) preparing a 
second gene expression profile of the treated T lymphocyte population; (4) comparing 

20 the first and second gene expression profiles with a gene expression profile of a 
normal T lymphocyte population; and (5) identifying an agent that modulates a T 
lymphocyte population found in a subject having a sterile inflammatory disease, 
autoimmune disorder, immunodeficiency disease, cancer, or GVHD. 

In yet another aspect, the present invention relates to a method of diagnosing a 

25 sterile inflammatory disease, autoimmune disorder, immunodeficiency disease, 
cancer, or GVHD in a subject. The method comprises the steps of: (1) preparing a 
first gene expression profile of a T lymphocyte population from the subject; (2) 
comparing the first gene expression profile to at least one second gene expression 
profile from a T lymphocyte population from a subject having a sterile inflammatory 

30 disease, autoimmune disorder, immunodeficiency disease, cancer, or GVHD and to a 
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third gene expression profile of a normal T lymphocyte population; and (3) 
determining if the subject has a sterile inflammatory disease, autoimmune disorder, 
immunodeficiency disease, cancer, or GVHD. 

Another aspect of the present invention relates to a method of identifying 
5 therapeutic or prophylactic compounds that modulate a T lymphocyte population that 
arises from a genetic defect or mutation. This method comprises the steps of: (1) 
preparing a first gene expression profile of a T lymphocyte population that arises from 
the genetic defect; (2) treating the first T lymphocyte population with the agent; (3) 
preparing a second gene expression profile of the treated T lymphocyte population; 
10 (4) comparing the first and second gene expression profiles with a gene expression 
y profile of a normal T lymphocyte population; and (5) identifying agents that modulate 

CP a T lymphocyte population that arises from a genetic defect. 

jE In contemplated variations of the foregoing methods, relevant data from, e.g., 

irni 

^ the first or second (or third) gene expression profile (e.g., as shown by the positions 

15 and intensities of bands on a gel or the quantity, intensity and relative elution position 
M from a column) may be compared with corresponding relevant data with data that has 

L~ been stored in an electronic or computer-readable format. For example, a data base of 

jt the gene expression profile of quiescent T cells may be used to compare the gene 

M expression profile of T cells that have been activated by exposure to any particular 

20 antigen. Similarly, a gene expression profile of a T cell population exposed to an 
agent that is being evaluated as a candidate agent to modulates the response of a T 
lymphocyte population to an antigen can be compared to a data base of the gene 
expression characteristics of a gene expression profile of a T lymphocyte population 
exposed to that antigen. 
25 Thus, the present invention also relates to a method to identify a therapeutic or 

prophylactic agent that modulates the response of a T lymphocyte population to an 
antigen, comprising the steps of: (1) preparing a first gene expression profile of a T 
lymphocyte population exposed to the antigen; (2) treating the exposed T lymphocyte 
population with the agent; (3) preparing a third gene expression profile of the treated 
30 T lymphocyte population; (4) comparing the first and second gene expression profiles 
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with a data base containing the corresponding gene expression profile information 
from a quiescent T lymphocyte population not contacted with the antigen; and (6) 
identifying agents that modulate the response of a lymphocyte population to the 
antigen. 

5 The present invention also relates to a method to identify a therapeutic or 

prophylactic agent that modulates the response of a T lymphocyte population to an 
antigen, comprising the steps of: (1) preparing a first gene expression profile from a T 
lymphocyte population that has been exposed to the antigen and with the candidate 
therapeutic or prophylactic agent; and (2) comparing the first gene expression profile 

10 with a data base containing the corresponding gene expression profile information 
from a quiescent T lymphocyte population not contacted with the antigen and, as 
appropriate, also comparing the first gene expression profile with the corresponding 
gene expression profile information from a T lymphocyte population that was 
contacted with the antigen but not contacted with the candidate therapeutic or 

15 prophylactic agent; and (3) identifying agents that modulate the response of a 
lymphocyte population to the antigen. 

In a similar fashion, it is contemplated that the gene expression profile 
information from the "first" gene expression profiles in the methods summarized 
above, Le. 9 from quiescent or from diseased but untreated T lymphocyte populations, 

20 can reside in a data base to which the gene expression profile information of the 
"second" gene expression profile may be compared. 

Such data bases also reflect another aspect of the present invention and these 
data bases may contain data based on one or more separately prepared profiles and 
further may reflect averaged or normalized or otherwise manipulated information. 

25 When comparisons are made using data that reflects the average of separately 

prepared profiles, an average prepared from two separate profiles in preferred, more 
preferably from three or four such profiles, and most preferably from five or more 
such profiles. One skilled in the art will know how to prepare and manipulate the 
information in such data bases in order to maximize the practical value of the data 

30 contained therein. 
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In an article of manufacture aspect, the present invention relates to a grouping 
of nucleic acids affixed to a solid support, said nucleic acids preferably represent the 
genes or fragments of genes (or corresponding cDNAs or RNAs) whose expression 
levels are modulated in a T lymphocyte population that arises, e.g., from exposure to 
5 an antigen; are modulated in a T lymphocyte population found in a subject having a 
sterile inflammatory disease, autoimmune disorder, immunodeficiency disease, 
cancer, or GVHD; or are modulated in a T lymphocyte population arising from a 
genetic defect. 

In yet another aspect, the present invention relates to an isolated nucleic acid 
10 molecule comprising the structure: R-X-R\ wherein X is the novel DNA sequence (or 
corresponding RNA sequence) or novel portions thereof as identified in SEQ ID NOS. 
16, 22, 24, 25, 31, 33 or 34; and wherein R and R' are sequences contiguous with X in 
nucleic acid fragments which specifically hybridize with X. 



Brief Description of the Drawings 

15 Figure 1 presents autoradiograms of the expression profiles generated from 

cDNAs made with RNA isolated from quiescent Jurkat cells treated with either 12-0- 
tetradecanoylphorbol-13-acetate (TP A) and phytohemaggiutinin (PHA) or ionomycin 



and TP A. 
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(A) All possible 12 anchoring oligo d(T 18 )nl, n2 were used to generate a 
complete expression profile for the restriction enzyme Bgl II, Expression profiles were 
also generated, and yielded analogous results, using all possible 12 anchoring oligo 
d(T 18 )nl, n2 for each of the following restriction enzymes: Xball, Spel, Ncol, Hind III 
BamHIand Xbal. (B) Figure IB is an autoradiogram of dilutions using RP 8,6 
wherein the cDNAs have been digested with Hind III. 

Figure 2 is a three dimensional and generalized graphical representation of 
gene expression in (a) quiescent T lymphocytes; (b) gene expression in activated T 
lymphocytes; and (c) the differences in gene expression between quiescent and 
activated T lymphocytes. 

Figure 3 represents quantitative differences of specific cDNA bands from 
display gels of quiescent versus activated T lymphocytes (Jurkat cells). Clone TA1 
(SEQ ID NO.l) JkAl (SEQ ID NO.2) TA1 (SEQ ID NO.l) JkA2 (SEQ ID NO.3), 
JkA3 (SEQ ID NO.4) JkA4 (SEQ ID NO.5) JkA5 (SEQ ID NO.6) JkA6 (SEQ ID 
NO.7) JkA7 (SEQ ID NO.8) JkA8 (SEQ ID NO.9) JkA9 (SEQ ID NO. 10) JkAlO 
(SEQ ID NO.l 1) JkAl 1 (SEQ ID NO.12) JkRl (SEQIDNO.13) 
In each panel, the left lane is a cDNA band from untreated T lymphocytes and the 
right lane is the corresponding cDNA band from activated T lymphocytes. In panel 7 
(not panel 7(a)), peripheral blood T lymphocyte RNA was used for RT-PCR. 

Figure 4 is a table of the cDNA bands of Figure 3 that correspond to mRNA 
species that are differentially expressed in the quiescent Jurkat cells and in activated 
Jurkat cells. 
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Figure 5 presents summary data for mRNAs that are differentially expressed in 
activated versus quiescent T cells and the identity of cDNAs corresponding to said 
mRNAs. Although not represented in the figure, the inventors have identified yet 
another sequence (SEQ ID N0.34) that corresponds to a differentially expressed 
5 mRNA. 

Modes of Carrying Out the Invention 

General Description 

The response of T lymphocytes to antigens, superantigens, allografts and 
xenografts, T lymphocyte function in autoimmune and immunodeficiency disorders, 

10 as well as T lymphocyte neoplasms, is a subject of primary importance in view of the 
need to find ways to modulate the immune response. Similarly, the response of T 
lymphocytes to agonists (pro-inflammatory molecules), such as IL-2, is a subject of 
great importance in view of the need to find better methods of controlling 
inflammation brought about by various disease states. 

15 One means of assessing the response of T lymphocytes to antigens, as well as 

determining differences between diseased lymphocytes versus normal, is to measure 
the ability of T lymphocytes to synthesize specific RNA de novo either upon contact 
with an antigen, allergen or pathogen or the state of RNA message in T lymphocytes 
found in different T lymphocyte disorders. 

20 The following discussion presents a general description of the invention as 

well definitions for certain terms used herein. 

Definitions 

As used herein, "T lymphocyte population", also referred to as a "T cell 
population", refers to any population of T lymphocytes obtained from a variety of 
25 sources, such as mammalian {e.g., human) spleen, tonsils and peripheral blood. See 
Lewis et al. y (1988) Proc. Natl. Acad. Sci., 85: 9743-9747. T-cell clones, such as a 
Jurkat cell line, may also be used. Such T lymphocyte clones are numerous and 
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commonly available (see Research Monographs in Immunology, eds. von Doehmer, 
H. and Haaf, V.; Section D: "Human T-Cell Clones", vol. 8, pgs. 243-333; Elsevier 
Science Publishers, N.Y.[1985]). 

As used herein, "quiescent T lymphocytes" refers to resting T lymphocytes 
that have not been activated by exposure to an activating agent, pathogen, mitogen, 
immunogen, allergen, antigen or superantigen or any other agent that induces a 
change in T lymphocyte mRNA expression. 

As used herein, "activated T lymphocytes" refers to T lymphocytes which 
have been exposed to an activating agent, pathogen, mitogen, immunogen, allergen, 
antigen, superantigen or any other agent that induces a change in T lymphocyte 
mRNA expression. 

As used herein, "T H1 " refers to the subpopulation of T helper cells which 
secrete IL-2 and interferon y (IFN-y), but not IL-4. T H1 cells also secrete IL-3 and 
granulocyte-monocyte colony-stimulating factor (GM-CSF). 

As used herein, "T H2 n refers to the subpopulation of T helper cells which 
secrete IL-4, IL-5, but not IL-2 or IFN-y. T H2 cells also secrete IL-3 and granulocyte- 
monocyte colony-stimulating factor (GM-CSF). 

As used herein, "T DTH refers to the subpopulation of T lymphocytes that 
initiate delayed hypersensitivity reactions and express, amongst others, the following 
markers: TCR, CD4, IL-2, and IL-3. 

As used herein, T CTL refers to the subpopulation of T lymphocytes that lyse 
specific target cells. T CTL cells express amongst other markers: TCR and CD8. 

As used herein, T s refers to the subpopulation of T cells that suppress immune 
responses; these cell express amongst other markers: TCR and CD8. 

As used herein, "cortical T cells" refers to T cells which express CD7, CDS, 
CD2, and CD38. 

As used herein, "medullary T cells" refers to T cells which express CD1, CD3, 
CD4, CD8, CDS, CD2, CD7 and CD38. 
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As used herein, "peripheral T lymphocytes" refers to T lymphocytes which 
circulate through the blood and other non-thymus lymphoid organs and express either 
CD4 or CD8. 

As used herein, "NK cells" refers to a class of lymphocytes that do not bear 
5 markers for either T lymphocytes or B cells, but includes natural killer (NK) cells. 
"NK cells" express amongst other markers FcR, CD 16, perforins and granzymes. 

As used herein, "TIL cells" refers to tumor infiltrating lymphocytes extracted 
from tumor tissue. 

As used herein, "LAK cells" refers to lymphokine activated killer cells that do 
10 not bear T lymphocyte markers, are not MHC restricted, lyse cell lines resistant to 
natural killer (NK) cell killing, and are effective in reducing large tumor cell masses 
in mice. Stewart Sell, 919 (1996). 

As used herein, the term "antigen" refers to a substance that elicits an immune 
response and can include superantigens. 
15 As used herein, the term "sterile inflammatory disease" refers to any 

inflammatory disease caused by immune or non-immune mechanisms not directly 
linked to infection (see Stewart Sell etal., 1996). Examples of sterile 
inflammatory diseases include, but are not limited to psoriasis, rheumatoid arthritis, 
glomerulonephritis, asthma, cardiac and renal reperfusion injury, thrombosis, adult 
20 respiratory distress syndrome, inflammatory bowel diseases, such as Crohn's disease 
and ulcerative colitis and periodontal disease. 

As used herein, the term "mitogen" refers to a class of substances that 
stimulate lymphocytes to proliferate independently of antigen. 

As used herein, the term "superantigen" refers to a special class of antigens, 
25 defined by their capacity to stimulate a large fraction of T lymphocytes. 

Superantigens include several "enterotoxins", which are globular proteins released by 
such bacteria as Staphylococcus aureus. 

As used herein, the term "pathogen" refers to any infectious organism 
including bacteria, viruses, parasites, mycoplasma, protozoans, and fungi (including 
30 molds and yeast). "Pathogenic bacteria" include, but are not limited to Staphylococci 
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(e.g., aureus), Streptococci (e.g., pneumoniae) , Clostridia (e.g., perfringens), 
Neisseria (e.g., gonorrhoeae), Enterobacteriaceae (e.g., E. coli as well as Klebsiella, 
Salmonella, Shigella, Yersinia and Proteus), Helicobacter (e.g., pylori), Vibrio (e.g., 
cholerae), Campylobacter (e.g., jejuni), Pseudomonas (e.g., aeruginosa), 
5 Haemophilus (e.g., influenzae), Bordetella (e.g., pertussis), Mycoplasma (e.g., 
pneumoniae), Ureaplasma (e.g., urealyticum), Legionella (e.g., pneumophila), 
Spirochetes (e.g., Treponema, Leptospira and Borrelia), Mycobacteria (e.g., 
tuberculosis, smegmatis), Actinomyces (e.g., israelii), Nocardia (e.g., asteroides), 
Chlamydia (e.g., trachomatis), Rickettsia, Coxiella, Ehrilichia, Rochalimaea, 

10 Brucella, Yersinia, Fracisella t Mycobacterium leprae, and Pasteurella. 

"Pathogenic viruses" include amongst others: human T lymphocyte virus type 
I and II (HTLV-I and HTLV-II), Epstein-Barr Virus (EBV), group C adenoviruses, 
herpes simplex virus, cytomegalovirus, poliovirus, rubella, measles, mumps 
respiratory syncytial virus (RSV), vesicular stomatitis virus (VSV), HIV-1, rabies 

15 virus, influenza A, parainfluenza, and lymphocytic choriomeningitis virus. 

As used herein, "immunodeficiency diseases" or "immunodeficiency 
disorders" include both acquired immunodeficiency and primary immunodeficiency 
disorders. Primary immunodeficiency disorders encompass: antibody deficiency 
disorders (e.g., sex linked agammaglobulinemia, common variable 

20 immunodeficiency, selective IgA deficiency, immunodeficiency with elevated IgM, 
transient hypogammaglobulinemia of infancy, antibody deficiency with near normal 
immunoglobulins, and X-linked lymphoproliferative disease); cellular 
immunodeficiency disorders (e.g., thymic hypoplasia, and Nezelof s syndrome); and 
severe combined immunodeficiency (SCID) disorders (e.g., autosomal recessive 

25 severe combined immunodeficiency disease, X-linked recessive severe combined 

immunodeficiency disease, defective expression of major histocompatibility complex 
antigens, and severe combined immunodeficiency with leukopenia). 

The phrase "solid support" refers to any support to which nucleic acids can be 
bound or immobilized, including nitrocellulose, nylon, glass, other solid supports 



WO 99/57130 PCT/US99/09761 

-13- 

which are positively charged, including nanochannel glass arrays disclosed by Beattie 
(W095/1175). 

The phrase "specifically hybridizes" refers to nucleic acids which hybridize 
under highly stringent or moderately stringent conditions to the nucleic acids 

5 containing at least one of the sequences identified in the Figures. In referencing the 
Figures, we mean at all points the Figures and the discussion of the Figures found in 
the Brief Description of Drawings. Preferably such specifically hybridizing nucleic 
acids will present a clear and detectable signal, and they may be labeled by various 
means that are known to those skilled in the art. 

10 The phrase "isolated nucleic acid" refers to nucleic acids that have been 

separated from contaminant nucleic acids encoding other polypeptides. "Nucleic 
acids" refers to all forms of DNA and RNA, including cDNA molecules and antisense 
RNA molecules. 

The phrase "sequences contiguous with" refers to sequences which are 
15 covalently linked to a given nucleic acid sequence or fragment at either the 5 f or 3* end 
through phospho-diester bonds. Such contiguous sequences are contained within a 
single molecule with a given nucleic acid sequence or fragment, such as a cDNA 
molecule. 

The phrase "gene expression profile", also referred to as a "differential 
20 expression profile" or "expression profile", refers to a representation of the expression 
of mRNA species in a cell sample or population. For instance, a gene expression 
profile can refer to an autoradiograph of labeled cDNA fragments produced from total 
cellular mRNA separated on the basis of size by known procedures. Such procedures 
include slab gel electrophoresis, capillary gel electrophoresis, high performance liquid 
25 chromatography (HPLC), and the like. Digitized representations of scanned 

electrophoresis gels are also included as are two and three dimensional representations 
of the digitized data. 

While a gene expression profile encompasses a representation of the 
expression level of at least one mRNA species, in practice, the typical gene expression 
30 profile represents the expression level of multiple mRNA species. For instance, a 
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gene expression profile useful in the methods and compositions disclosed herein 
represents the expression levels of at least about 1, 5, 10, 20, 50, 100, 150, 200, 300, 
500, 1000, 10,000, 50,000 or, more preferably, substantially all of the detectable 
mRNA species in a cell sample or population. Particularly preferred are gene 

5 expression profiles or arrays affixed to a solid support that contain a representative 
number of mRNA species, whose expression levels are modulated under the relevant 
infection, disease, screening, treatment or other experimental conditions. In some 
instances, a sufficient representative number of such mRNA species will be at least 
about 1, 2, 5, 10, 15, 20, 25, 30, 40, 50, 50-75 or 100. The expression profiles 

10 preferably will also be able to quantitatively differentiate the relative quantities of 
mRNA species in a cell. 

Gene expression profiles can be produced by any means known in the art, 
including, but not limited to the methods disclosed by: Liang et al t (1992) Science 
257: 967-971; Ivanova et al. 9 (1995) Nucleic Acids Res. 23: 2954-2958; Guilfoyl et 

15 aL, (1997) Nucleic Acids Res. 25(9): 1854-1858; Chee et al, (1996) Science 274: 610- 
614; Velculescu et al. t (1995) Science 270: 484-487; Fischer etal, (1995) Proc. Natl 
Acad. Sci. USA 92(12): 5331-5335; andKato, (1995) Nucleic Acids Res. 23(18): 
3685-3690. Gene expression profiles also may be produced by the methods of 
Belyavsky et al y in U.S. Patent Application Serial No. 08/499,899. Preferably, gene 

20 expression profiles are produced by the methods of Prashar et aL, (WO 97/05286) and 
Prashar et al t (1996) Proc. Natl. Acad. Sci. USA 93: 659-663. 

As an example, gene expression profiles as described herein are made to 
identify one or more genes whose expression levels are modulated in a T lymphocyte 
population exposed to a pathogen, allergen, antigen, mitogen or superantigen, or a T 

25 lymphocyte population isolated, for example, from a subject having a sterile 

inflammatory disease, GVHD or cancer. The assaying of the modulation of gene 
expression via the production of a gene expression profile generally involves the 
production of cDNA from polyA RNA (mRNA) isolated from T lymphocytes as 
described below. 
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The mRNAs are isolated from a T lymphocyte source (e.g., prothymocytes or 
T H2 cells) The cells may be obtained from an in vivo source, such as a peripheral 
blood or thymus. As is apparent to one skilled in the art, any lymphocyte type (e.g., 
plasma cells, B cells, T lymphocytes or T lymphocyte subpopulations) may be used, 

5 however, T lymphocytes are preferred. Furthermore, the peripheral blood cells that 
are initially obtained may be subjected to various separation techniques (e.g., flow 
cytometry, density gradients) to purify specific T lymphocyte subpopulations (e.g., 
Tdth> T ctl , T H2 , etc.). 

"mRNAs" are isolated from cells by any one of a variety of techniques. 

10 Numerous techniques are well known (see e.g., Sambrook etal., Molecular 
Cloning: A Laboratory Approach, Cold Spring Harbor Press, NY, (1987); 
Ausubel etal., Current Protocols in Molecular Biology, Greene Publishing 
Co. NY, (1995)). In general, these techniques first lyse the cells and then enrich for or 
purify RNA. In one such protocol, cells are lysed in a Tris-buffered solution 

15 containing sodium dodecyl sulfate. The lysate is extracted with phenol/chloroform, 
and nucleic acids are precipitated. Purification of poly(A)-containing RNA is not a 
requirement. The mRNAs may, however, be purified from crude preparations of 
nucleic acids or from total RNA by chromatography, such as binding and elution from 
oligo(dT)-cellulose or poly(U)-Sepharose® . As stated above, other protocols and 

20 methods for isolation of RNAs may be substituted. 

The mRNAs are reverse transcribed using an RNA-directed DNA polymerase, 
such as "reverse transcriptase" isolated from such retroviruses as AMV, MoMuLV or 
recombinantly produced. Many commercial sources of enzyme are available (e.g., 
Pharmacia, New England Biolabs, Stratagene Cloning Systems). Suitable buffers, 

25 cofactors, and conditions are well known and supplied by various manufacturers (see 
also, Sambrook etal., (1989) supra; Ausubel etal., (1995) supra). 

Various oligonucleotides are used in the production of cDNA. In particular, 
the methods described herein utilize oligonucleotide primers for cDNA synthesis, and 
adapters, and primers for amplification. Oligonucleotides are generally synthesized as 

30 single strands by standard chemistry techniques, including automated synthesis. 
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Oligonucleotides are subsequently de-protected and may be purified by precipitation 
with ethanol, chromatographed using a sized or reversed-phase column, denaturing 
polyacrylamide gel electrophoresis, high-pressure liquid chromatography (HPLC), or 
other suitable method. In addition, within certain preferred embodiments, a functional 
5 group, such as biotin, is incorporated preferably at the 5 1 or 3' terminal nucleotide. A 
biotinylated oligonucleotide may be synthesized using pre-coupled nucleotides, or 
alternatively, biotin may be conjugated to the oligonucleotide using standard chemical 
reactions known to individuals skilled in the art. Other functional groups, such as 
florescent dyes, radioactive molecules, digoxigenin, and the like, may also be used. 
10 Partially-double stranded adaptors are formed from single-stranded 

oligonucleotides by annealing complementary single-stranded oligonucleotides that 
are chemically synthesized or by enzymatic synthesis. Following synthesis of each 
strand, the two oligonucleotide strands are mixed together in a buffered salt solution 
(e.g., 1 M NaCl; 100 mM Tris-HCl, pH 8.0; 10 mM EDTA) or in a buffered solution 
15 containing Mg +2 {e.g., 10 mM MgCl 2 ) and annealed by heating to high temperature 
and slow cooling to room temperature. 

The oligonucleotide primer that primes first strand DNA synthesis comprises a 
5 1 sequence incapable of hybridizing to a polyA tail of the mRNAs, and a 3* sequence 
that hybridizes to a portion of the polyA tail of the mRNAs and at least one non- 
20 polyA nucleotide immediately upstream of the polyA tail. The 5 f sequence is 

preferably of a sufficient length such that can serve as a primer for amplification. The 
5* sequence also preferably has an average G+C content and does not contain large 
palindromic sequences; certain palindromes, such as a recognition sequence for a 
restriction enzyme, may be acceptable. Examples of suitable 5* sequences are 
25 CTCTCAAGGATC:TACCGCT (SEQ ID NO.35), CAGGGTAGACGACGCTACGC 
(SEQ ID NO.36), and TAATACCGCGCCACATAGCA (SEQ ID NO.37). 

The 5' sequence is joined to a 3' sequence comprising a sequence that 
hybridizes to a portion of the polyA tail of mRNAs and at least one non-polyA 
nucleotide immediately upstream. Although the polyA-hybridizing sequence is 
30 typically a homopolymer of dT or dU, it need only contain a sufficient number of dT 
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or dU bases to hybridize to polyA under the conditions employed. Both oligo-dT and 
oligo-dU primers have been used and give comparable results. Thus, other bases may 
be interspersed or concentrated, as long as hybridization is not impeded. Typically, 
12 to 18 bases or 12 to 30 bases of dT or dU will be used. However, as one skilled in 
5 the art appreciates, the length need only be sufficient to obtain hybridization. The 
non-polyA nucleotides are A, C, or G, or a nucleotide derivative, such as inosinate. If 
one non-polyA nucleotide is used, then three oligonucleotide primers are needed to 
hybridize to all mRNAs. If two non-polyA nucleotides are used, then 12 primers are 
needed to hybridize to all mRNAs (AA, AC, AG, AT, CA, CC, CG, CT, GA, GC, 
10 GG, GT). If three non-poly A nucleotides are used, then 48 primers are needed (3X4 
X 4). Although there is no theoretical upper limit on the number of non-polyA 
nucleotides, practical considerations make the use of one or two non-polyA 
nucleotides preferable. 

For cDNA synthesis, the mRNAs are either subdivided into three (if one non- 
15 polyA nucleotide is used) or twelve (if two non-polyA nucleotides are used) fractions, 
each containing a single oligonucleotide primer, or the primers may be pooled and 
contacted with a mRNA preparation. Other subdivisions may alternatively be used. 
Briefly, first strand cDNA is initiated from the oligonucleotide primer by reverse 
transcriptase (RTase). As noted above, RTase may be obtained from numerous 
20 sources and protocols are well known. Second strand synthesis may be performed by 
RTase (Gubler and Hoffman, Gene 25: 263, 1983), which also has a DNA-directed 
DNA polymerase activity, with or without a specific primer, by DNA polymerase 1 in 
conjunction with RNaseH and DNA ligase, or other equivalent methods. The double- 
stranded cDNA is generally treated with phenol xhloroform extraction and ethanol 
25 precipitation to remove protein and free nucleotides. 

Double-stranded cDNA is subsequently digested with an agent that cleaves in 
a sequence-specific manner. Such cleaving agents include restriction enzymes. 
Restriction enzyme digestion is preferred; enzymes that are relatively infrequent 
cutters (e.g., ^ 5 bp recognition site) are more preferred, and those that leave 
30 overhanging ends are especially preferred. A restriction enzyme with a six base pair 
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recognition site cuts approximately 8% of cDNAs, so that approximately 12 such 
restriction enzymes should be needed to digest every cDNA at least once. By using 
30 restriction enzymes, digestion of every cDNA is assured. 

The adapters for use in the present invention are designed such that the two 
5 strands are only partially complementary and only one of the nucleic acid strands that 
the adapter is ligated to can be amplified. Thus, the adapter is partially double- 
stranded (i.e., comprising two partially hybridized nucleic acid strands), wherein 
portions of the two strands are non-complementary to each other and portions of the 
two strands are complementary to each other. Conceptually, the adapter is "Y- 
10 shaped" or "bubble-shaped." When the 5* region is non-paired, the 3 1 end of other 
strand cannot be extended by a polymerase to make a complementary copy. The 
ligated adapter can also be blocked at the 3' end to eliminate extension during 
subsequent amplifications. Blocking groups include dideoxynucleotides or any other 
agent capable of blocking the 3 '-OH. In this type of adapter ("Y-shaped"), the non- 
15 complementary portion of the upper strand of the adapters is preferably a length that 
can serve as a primer for amplification. As noted above, the non-complementary 
portion of the lower strand need only be one base, however, a longer sequence is 
preferable (e.g., 3 to 20 bases; 3 to 15 bases; 5 to 15 bases; or 14 to 24 bases). The 
complementary portion of the adapter should be long enough to form a duplex under 
20 conditions of litigation. 

For "bubble-shaped" adapters, the non-complementary portion of the upper 
strands is preferably a length that can serve as a primer for amplification. Thus, this 
portion is preferably 15 to 30 bases. Alternatively, the adapter can have a structure 
similar to the Y-shaped adapter, but has a 3 r end that contains a moiety that a DNA 
25 polymerase cannot extend from. 

Amplification primers are also used in the present invention. Two different 
amplification steps are performed in the preferred aspect. In the first, the 3' end 
(referenced to mRNA) of double stranded cDNA that has been cleaved and ligated 
with an adapter is amplified. For this amplification, either a single primer or a primer 
30 pair is used. The sequence of the single primer comprises at least a portion of the 5' 
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sequence of the oligonucleotide primer used for first strand cDNA synthesis. The 
portion need only be long enough to serve as an amplification primer. The primer pair 
consists of a first primer whose sequence comprises at least a portion of the 5 f 
sequence of the oligonucleotide primer as described above; and a second primer 

5 whose sequence comprises at least a portion of the sequence of one strand of the 
adapter in the non-complementary portion. The primer will generally contain all the 
sequence of the non-complementary potion, but may contain less of the sequence, 
especially when the non-complementary portion is very long, or more of the sequence, 
especially when the non-complementary portion is very short. In some embodiments, 

10 the primer will contain a sequence of the complementary portion, as long as that 
sequence does not appreciably hybridize to the other strand of the adapter under the 
amplification conditions employed. For example, in one embodiment, the primer 
sequence comprises four bases of the complementary region to yield a 19 base primer, 
and amplification cycles are performed at 56° C (annealing temperature), 72 °C 

15 (extension temperature), and 94 °C (denaturation temperature). In another 
embodiment, the primer is 25 bases long and has 10 bases of sequence in the 
complementary portion. Amplification cycles for this primer are performed at 68 °C 
(annealing and extension temperature) and 94 °C (denaturation temperature). By 
using these longer primers, the specificity of priming is increased. 

20 The design of the amplification primers will generally follow well-known 

guidelines, such as average G-C content, absence of hairpin structures, inability to 
form primer-dimers and the like. At times, however, it will be recognized to those 
skilled in the art that deviations from such guidelines may be appropriate or desirable. 
After amplification, the lengths of the amplified fragments are determined. 

25 Any procedure that separate nucleic acids on the basis of size and allows detection or 
identification of the nucleic acids is acceptable. Such procedures include slab gel 
electrophoresis, capillary gel electrophoresis, high performance liquid 
chromatography (HPLC), and the like. 

Electrophoresis is technique based on the mobility of DNA in an electric field. 

30 Negatively charged DNA migrates towards a positive electrode at a rate dependent on 
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its total charge, size, and shape. Most often, DNA is electrophoresed in agarose or 
polyacrylamide gels. For maximal resolution, polyacrylamide is preferred and for 
maximal linearity, a denaturant, such as urea is present. A typical get setup uses a 
19:1 mixture of acrylamiderbisacrylamide and a Tris-borate buffer. DNA samples are 

5 denatured and applied to the gel, which is usually sandwiched between glass plates. A 
typical procedure can be found in Sambrook etal., (1989) or Ausubel etal., 
(1995). Variations may be substituted as long as sufficient resolution is obtained. 

Capillary electrophoresis, (CE) in its various manifestations (e.g., free 
solution, isotachophoresis, isoelectric focusing, polyacrylamide gel, micellar 

10 electrokinetic "chromatography"), allows high resolution separation of very small 
sample volumes. Briefly, in capillary electrophoresis, a neutral coated capillary, such 
as a 50 /^m X 37 cm column (eCAP neutral, Beckman Instruments, CA), is filled with 
a linear polyacrylamide (e.g., 0.2% polyacrylamide), a sample is introduced by high- 
pressure injection followed by an injection of running buffer (e.g., IX TBE). The 

15 sample is electrophoresed and fragments are detected. An order of magnitude 

increase can be achieved with the use of capillary electrophoresis. Capillaries may be 
used in parallel for increased throughput (Smith et al. f (1990) Nuc. Acids, Res. 18: 
4417; Mathies and Huang, (1992) Nature 359: 167). Because of the small sample 
volume that can be loaded onto a capillary, the sample may be concentrated to 

20 increase the level of detection. One means of concentration is sample stacking (Chien 
and Burgi, (1992) Anal. Chem. 64: 489 A). In sample stacking, a large volume of 
sample in a low concentration buffer is introduced to the capillary column. The 
capillary is then filled with a buffer of the same composition, but at higher 
concentration, such that when the sample ions reach the capillary buffer with a lower 

25 electric field, they stack into a concentrated zone. Sample stacking can increase 

detection by one to three orders of magnitude. Other methods of concentration, such 
as isotachophoresis, may also be used. 

High-performance liquid chromatography (HPLC) is a chromatographic 
separation technique that separates compounds in solution. HPLC instruments consist 

30 of a reservoir of mobile phase, a pump, an injector, a separation column, and a 



WO 99/57130 PCT/US99/09761 

-21- 

detector. Compounds are separated by injecting an aliquot of the sample mixture onto 
the column. The different components in the mixture pass through the column at 
different rates due to differences in their partitioning behavior between the mobile 
liquid phase and the stationary phase. IP-RO-HPLC on non-porous PS/DVB particles 

5 with chemically bonded alkyl chains can also be used to analyze nucleic acid 

molecules on the basis of size (Huber et al., (1993) Anal Biochem. 121: 351; Huber et 
al, (1993) Nuc. Acids Res. 21: 1061; Huber et al> (1993) Biotechniques 16: 898). 

In each of these analysis techniques, the amplified fragments are detected. A 
variety of labels can be used to assist in detection. Such labels include, but are not 

10 limited to, radioactive molecules (e.g., 35 S, 32 P, 33 P), fluorescent molecules, and mass 
spectrometric tags. The labels may be attached to the oligonucleotide primers or to 
nucleotides that are incorporated during DNA synthesis, including amplification. 

Radioactive nucleotides may be obtained from commercial sources; 
radioactive primers may be readily generated by transfer of label from y- 32 P-ATP to a 

15 5-OH group by a kinase (e.g., T4 polynucleotide kinase). Detection systems include 
autoradiograph, phosphor image analysis and the like. 

Fluorescent nucleotides may be obtained from commercial sources (e.g., ABI, 
Foster City, CA) or generated by chemical reaction using appropriately derivatized 
dyes. Oligonucleotide primers can be labeled, for example, using succinimidyl esters 

20 to conjugate to amine-modified oligonucleotides. A variety of florescent dyes may be 
used, including 6 carboxyfluorescein, other carboxyfluorescein derivatives, 
carboxyrhodamine derivatives, Texas red derivatives, and the like. Detection systems 
include photomultiplier tubes with appropriate wave-length filters for the dyes used. 
DNA sequence analysis systems, such as produced by ABI (Foster City, CA), may 

25 also be used. 

After separation of the amplified cDNA fragments, cDNA fragments which 
correspond to differentially expressed mRNA species are isolated, reamplified and 
sequenced according to standard procedures. For instance, bands corresponding the 
cDNA fragments can be cut from the electrophoresis gel, reamplified and subcloned 
30 into any available vector, including pCRscript using the PCR script cloning kit 
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(Stratagene). The insert is then sequenced using standard procedures, such as cycle 
sequencing on an ABI sequencer. 

An additional means of analysis comprises hybridization of the amplified 
fragments to one or more sets of oligonucleotides or nucleic acid fragments 

5 immobilized on a solid substrate. Historically, the solid substrate is a membrane, such 
as nitrocellulose or nylon. More recently, the substrate is a silicon wafer or a 
borosilicate slide. The substrate may be porous (Beattie et al., WO 95/1 1755) or 
solid. Oligonucleotides are synthesized in situ or synthesized prior to deposition on 
the substrate. Various chemistries are known for attaching oligonucleotide. Many of 

10 these attachment chemistries rely upon ftmctionalizing oligonucleotides to contain a 
primary amine group. The oligonucleotides are arranged in an array form, such that 
the position of each oligonucleotide sequence can be determined. 

The amplified fragments, which are generally labeled according to one of the 
methods described herein, are denatured and applied to the oligonucleotides on the 

15 substrate under appropriate salt and temperature conditions. In certain embodiments, 
the conditions are chosen to favor hybridization of exact complementary matches and 
disfavor hybridization of mismatches. Unhybridized nucleic acids are washed off and 
the hybridized molecules detected, generally both for position and quantity. The 
detection method will depend upon the label used. Radioactive labels, fluorescent 

20 labels and mass spectrometry label are among the suitable labels. 

The present invention as set forth in the specific embodiments, includes 
methods to identify a therapeutic agent that modulates the expression of at least one 
gene in a T lymphocyte population. Genes which are differentially expressed during 
T lymphocyte contact with a pathogen, antigen, superantigen, mitogen or allergen, or 

25 that are differentially expressed in a subject having a sterile inflammatory disease, 
cancer or GVHD are of particular importance. 

In general, the method to identify a therapeutic or prophylactic agent that 
modulates the response of a T lymphocyte population to a pathogen, mitogen, antigen, 
superantigen or allergen, comprises the steps of preparing a first gene expression 

30 profile of a quiescent T lymphocyte population or subpopulation, preparing a second 
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gene expression profile of a T lymphocyte population or subpopulation exposed to a 
pathogen, mitogen, antigen, superantigen or allergen, treating the exposed T 
lymphocyte population or subpopulation with the agent, preparing a third gene 
expression profile of the treated T lymphocyte population, comparing the first, second 

5 and third gene expression profiles and identifying agents that modulate the response 
of a T lymphocyte population or subpopulation to the pathogen, mitogen, antigen, 
superantigen or allergen. 

In another format, the method is used to identify a therapeutic agent that 
modulates the expression of genes in a T lymphocyte population or subpopulation 

10 found in a human or animal subject having a sterile inflammatory disease, GVHD or 
cancer. The general method comprises the steps of preparing a first gene expression 
profile of a T lymphocyte population or subpopulation in a subject having a sterile 
inflammatory disease GVHD or cancer, treating or exposing the T lymphocyte 
population or subpopulation to the agent, preparing a second gene expression profile 

15 of the treated T lymphocyte population or subpopulation, comparing the first and 
second gene expression profiles with the gene expression profile of a normal T 
lymphocyte population and identifying agents that modulate the expression of genes 
whose transcription levels are altered in the T lymphocyte population or 
subpopulation of the subject as compared with normal T lymphocyte population or 

20 subpopulation. 

While the above methods for identifying a therapeutic agent comprise the 
comparison of gene expression profiles from treated and not-treated T lymphocytes, 
many other variations are immediately envisioned by one of ordinary skill in the art. 
As an example, as a variation of a method to identify a therapeutic or prophylactic 

25 agent that modulates the response of a T lymphocyte population or subpopulation to 
pathogen, mitogen, antigen, superantigen or allergen, the second gene expression 
profile of a T lymphocyte population or subpopulation exposed to a pathogen, 
antigen, mitogen, immunogen or allergen and the third gene expression profile of the 
treated T lymphocyte population or subpopulation can each be independently 

30 normalized using the first gene expression profile prepared from a quiescent T 
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lymphocyte population or subpopulation. Normalization of the profiles can easily be 
achieved by scanning autoradiographs corresponding to each profile, and subtracting 
the digitized values corresponding to each band on the autoradiograph from quiescent 
T lymphocytes from the digitized value for each corresponding band on 
5 autoradiographs corresponding to the second and third gene expression profiles. After 
normalization, the second and third gene expression profiled can be compared directly 
to detect cDNA fragments which correspond to mRNA species, which are 
differentially expressed upon exposure of the T lymphocyte population or 
subpopulation to the agent to be tested. 



1 0 Nucleic A cid Fragments 

Nucleic acids of the claimed invention include nucleic acids which specifically 
hybridize to nucleic acids comprising the sequences identified in the Figures. A 
nucleic acid which specifically hybridizes to a nucleic acid comprising one of the 
sequences identified in the Figures remains stably bound to said nucleic acid under 

15 highly stringent or moderately stringent conditions. Stringent and moderately 

stringent conditions are those commonly defined and available, such as those defined 
by Sambrook et al., (1989) or Ausubel et al., (1995). The precise level of stringency 
is not important, rather, conditions should be selected that provide a clear, detectable 
signal when specific hybridization has occurred. 

20 Hybridization is a function of sequence identity (homology), G+C content of 

the sequence, buffer salt content, sequence length and duplex melt temperature (T m ) 
among other variables. See, Maniatis et al., ( Molecular Cloning, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y. 1982). With similar sequence lengths, 
the buffer salt concentration and temperature provide useful variables for assessing 

25 sequence identity (homology) by hybridization techniques. For example, where there 
is at least 90 percent homology, hybridization is commonly carried out at 68°C in a 
buffer salt such as 6X SCC diluted from 20X SSC. See Sambrook et al., (1989) The 
buffer salt utilized for final Southern blot washes can be used at a low concentration, 
e.g., 0.1X SSC and at a relatively high temperature, e.g., 68°C, and two sequences will 
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form a hybrid duplex (hybridize). Use of the above hybridization and washing 
conditions together are defined as conditions of high stringency or highly stringent 
conditions. Moderately stringent conditions can be utilized for hybridization where 
two sequences share at least about 80 percent homology. Here, hybridization is 

5 carried out using 6X SSC at a temperature of about 50-55°C. A final wash salt 
concentration of about 1-3X SSC and at a temperature of about 60-68°C are used. 
These hybridization and washing conditions define moderately stringent conditions. 

In particular, specific hybridization refers to conditions in which a high degree 
of complementarity exists between a nucleic acid comprising the sequences identified 

10 in at least one of the Figures and another nucleic acid. With specific hybridization, 
complementarity will generally be at least about 75%, 80%, 85%, preferably about 
90-100%, or most preferably about 95-100%. 

The nucleic acids of the present invention can be used in a variety of ways in 
accordance with the present invention. For example, they can be used as nucleic acid 

15 probes to screen other cDNA and genomic DNA libraries so as to select by 

hybridization other DNA sequences that comprise similar sequences. The nucleic 
acid probe could be RNA or DNA labeled with radioactive nucleotides or by 
non-radioactive methods (Le., biotin). Screening could be done at various 
stringencies (through manipulation of the hybridization T m , usually using a 

20 combination of ionic strength, temperature and/or presence of formamide) to isolate 
close or distantly related homologs. The nucleic acids may also be used to generate 
primers to amplify cDNA or genomic DNA using polymerase chain reaction (PCR) 
techniques. The nucleic acid sequences of the present invention can also be used to 
identify adjacent sequences in the cDNA or genome, for 

25 example, flanking sequences and regulatory elements. 

The nucleic acid sequences of the present invention can also be used 
diagnostically to detect nucleic acid sequences which specifically hybridize to at least 
one of the sequences identified in the Figures. For instance, said sequences can be 
used to detect activated T lymphocytes or T lymphocytes previously exposed to a 

30 specific pathogen, mitogen, antigen, superantigen or allergen. As set forth in the 
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Examples, said sequences are the partial sequences of cDNA species which 
correspond to T lymphocyte mRNA and therefore genes, which are differentially 
expressed during T lymphocyte contact with a specific pathogen, mitogen, antigen, 
superantigen or allergen. Nucleic acid fragments comprising at least part of these 
5 sequences may be used as diagnostic probes to identify T lymphocyte populations 
that have been activated or have been in contact with a specific pathogen, mitogen, 
antigen, superantigen or allergen. 

In general, nucleic acid fragments comprising at least one of the sequences or 
part of one of the sequences identified in the Figures can be used as probes to screen 

10 nucleic acid samples from T lymphocyte populations in hybridization assays. Such 
assays can be used to detect activated T lymphocytes or T lymphocytes exposed to a 
specific pathogen, mitogen, antigen, superantigen or allergen. To ensure specificity of 
a hybridization assay using a probe derived from such sequences, it is preferable to 
design probes which hybridize only with target nucleic acid under conditions of high 

15 stringency. Only highly complementary nucleic acid hybrids form under conditions 
of high stringency. Accordingly, the stringency of the assay conditions determines 
the amount of complementarity which should exist between two nucleic acid strands 
in order to form a hybrid. Stringency should be chosen to maximize the difference in 
stability between the probe:target hybrid and potential probemon-target hybrids. 

20 Probes may be designed from the sequences of the Figures through methods 

known in the art. For instance, the G+C content of the probe and the probe length can 
affect probe binding to its target sequence. Methods to optimize probe specificity are 
commonly available in Sambrook et al^ (1989), or Ausubel et al., (1995). 

The nucleic acid sequences of the present invention can also be used as probes 

25 to monitor the expression of at least one differentially expressed T lymphocyte gene 
in a method to identify a therapeutic or prophylactic agent that modulates the response 
of a T lymphocyte population to a specific pathogen, mitogen, antigen, superantigen 
or allergen. In general, the method to identify a therapeutic or prophylactic agent that 
modulates the response of a T lymphocyte population to a specific pathogen, mitogen, 

30 antigen, superantigen or allergen comprises the steps of determining the expression 
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level of at least one RNA species that specifically hybridizes to a probe comprising all 
or part of at least of one of the sequences identified in the Figures in a quiescent T 
lymphocyte population. The expression level of the RNA species is then determined 
in a T lymphocyte population exposed to a specific pathogen, mitogen, antigen, 

5 superantigen or allergen and also in a T lymphocyte population exposed to a specific 
pathogen, mitogen, antigen, superantigen or allergen and to the agent to be tested. 
Agents which modulate the expression level of a RNA species associated with T 
lymphocyte activation by a specific pathogen, mitogen, antigen, superantigen or 
allergen are thereby identified by comparing the expression levels of the RNA 

10 species. 

Hybridization assays to determine the expression level of at least one RNA 
species are commonly available and include the detection of DNA:RNA and 
RNA:RNA hybrids. Northern blots of total cellular RNA or polyA purified RNA and 
hybridization assays wherein at least one or part of one of the sequences of the present 

15 invention are immobilized to a solid support are included. 

Solid supports can be prepared that comprise immobilized representative 
groupings of nucleic acids corresponding to the sequences or parts of the sequences 
identified in the Figures. For instance, representative nucleic acids can be 
immobilized to any solid support to which nucleic acids can be immobilized, such as 

20 positively charged nitrocellulose or nylon membranes (see Sambrook et aL 9 1989), as 
well as porous glass wafers such as those disclosed by Beattie (WO 95/1 1755). 
Nucleic acids are immobilized to the solid support by well established techniques, 
including charge interactions, as well as attachment of derivatized nucleic acids to 
silicon dioxide surfaces such as glass which bears a terminal epoxide moiety. A solid 

25 support comprising a representative grouping of nucleic acids can then be used in 
standard hybridization assays to detect the presence or quantity of one or more 
specific nucleic acid species in a sample (such as a total cellular mRNA sample or 
cDNA prepared from said mRNA) which hybridize to the nucleic acids attached to 
the solid support. Any hybridization methods, reactions, conditions and/or detection 
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means can be used, such as those disclosed by Sambrook et ai y (1989), Ausbel et ai, 
(1987), or Beattie (WO 95/1 1755). 

One of ordinary skill in the art may determine the optimal number of nucleic 
acid species that must be represented by nucleic acid fragments immobilized on the 
5 solid support to effectively differentiate between samples, e.g., T lymphocytes 
exposed to a specific pathogen, mitogen, antigen, superantigen or allergen. 
Preferably, at least about 1, 5, 10, 20, 28 or more nucleic acid fragments 
corresponding to at least one or part of one of the sequences identified in the Figures 
are affixed to a solid support. The skilled artisan will be able to optimize the number 
10 and particular nucleic acids for a given purpose, i.e., screening for modulating agents, 
identifying activated T lymphocytes, etc. 



Full Length cDNA Fragments 

The determination of the sequences identified in the Figures as derived from T 
lymphocyte mRNA species (genes) that are differentially regulated in response to 

15 exposure of a T lymphocyte population to a specific pathogen, mitogen, antigen, 

superantigen or allergen enables the isolation of full length cDNA molecules encoding 
proteins associated with the T lymphocyte response. 

As set forth above, any method may be used to prepare a cDNA library from T 
lymphocytes. Preferably, the cDNA library is produced from T lymphocytes exposed 

20 to a specific pathogen, mitogen, antigen, superantigen or allergen. After exposure, the 
cDNA library is prepared by extracting the mRNA from isolated T lymphocytes, 
using known methods, for example, isolation of polyadenylated (poly A+) RNA. Kits 
for isolating poly A+ RNA are commercially available, for example, PolyATract kits 
are available from Promega Corporation. The mRNA thus extracted may be enriched 

25 for mRNAs corresponding to genes differentially expressed by hybrid selection 
procedures, and the like. In instances of a low recovery of RNA, mRNA can be 
preamplified by PCR using known methods. The cDNAs corresponding to the 
mRNAs may be prepared using a reverse transcriptase for first strand synthesis and a 
DNA polymerase for second strand synthesis. Methods for using reverse transcriptase 
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and DNA polymerase to make cDN A are well known in the art. Kits for performing 
these techniques are commercially available, for example, the Superscript II™ kit 
(Gibco-BRL), the Great Lengths cDNA Synthesis Kit™ (Clontech), the cDNA 
Synthesis Kit (Stratagene), and the like. 

5 The cDNAs may then be ligated to linker DNA sequences containing suitable 

restriction enzyme recognition sites. Such linker DNAs are commercially available, 
for example, from Promega Corporation and from New England Biolabs and the 
particular linker used may be selected to conform to the protocol being used. The 
cDNAs may be subjected to restriction enzyme digestion, size fractionation, or any 

10 other suitable method, to enrich for full-length cDNAs within the library. 

The resultant cDNA library can be modified to select for rare transcripts or for 
transcripts corresponding to genes that are differentially expressed upon exposure to a 
specific pathogen, mitogen, antigen, superantigen or allergen. For instance, when 
using a sequence corresponding to a mRNA which is up- regulated in response to 

15 exposure of a T lymphocyte population to a specific pathogen, mitogen, antigen, 

superantigen or allergen, a cDNA library is prepared from exposed T lymphocytes and 
subtractively hybridized to cDNA or mRNA (polyA+ RNA) from a quiescent T 
lymphocyte population. Subtractive hybridization methods are available, for instance 
as taught by Davis et aL, (1987) Cell 51 : 987-1000; Hedrick et aL, (1984) Nature, 

20 308: 149-153; and Sargent et aL, (1983) Science, 222: 135-139. Subtractive library 
methods that utilize PCR amplification may also be used as taught, for example, by 
Wieland et al, (1990) Proc. Natl Acad. Set USA, 87: 2720-2724; Wang et aL 9 (1991) 
Proc. Natl Acad. Set USA, 88: 11505-11509; Cecchini et aL, (1993) Nucleic Acids 
Res., 21 : 5742-5747; Lebeau et aL, (1991) Nucleic Acids Res., 19: 4778; Duguid et 

25 aL, (1990) Nucleic Acids Res., 18: 2789-2792. and U.S. Patent 5,525,471 

The resultant cDNA library may also be normalized to obtain cDNAs 
corresponding to rare or weakly expressed mRNA species. Many procedures are 
available including hybridization of the cDNA library to genomic DNA as taught by 
Weissman et aL, (1987) Mol Biol Med., 4: 133-143. Other available techniques 

30 include utilizing second order hybridization kinetics to select for rarer species as 
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taught by Ko et al., (1990) Nuc. Acids. Res., 18: 5709; Patanjali et al t (1991) Proc. 
Natl. Acad. ScL USA, 88: 1943-1947 or Soares et al, (U.S. Patent No. 5,637,685). 

The cDNA library is then screened from cDNA clones that specifically 
hybridize to a nucleic acid comprising at least one or part of one of the sequences 

5 identified in the Figures. Such methods are widely available as set forth above. 

After isolation of cDNA clones which specifically hybridize to a nucleic acid 
comprising at least one or part of one of the sequences identified in the Figures, the 
inserts into the cDNA molecules can be further characterized by known methods 
including the sequencing of the cDNA insert. In instances where the 5 ! end of the 

10 cDNA encompassing the amino terminus of an encoded protein is not contained 

within a cDNA molecule isolated by the above methods, 5'RACE PCR amplification 
and other known procedures can be used to retrieve the 5' end of the cDNA. Such 
methods are known in the art and exemplified by Fang et al, (1997) Biotechniques, 
23 (1): 52, 54, 56, 58; Chen (1996) Trends Genet, 12 (3): 87-88; Lung et al, (1996) 

15 Trends Genet, 12 (10): 389-91; Bahring et al, (1994) Biotechniques, 16 (5): 807-8; 
and Borson et al., (1992) PCR Methods Appl, 2(2): 144-8. 

Related Nucleic Acids 

As used herein, the present invention encompasses nucleic acids wherein 
sequences flanking or contiguous with at least one of the sequences identified in the 
20 Figures include: sequences remaining in the same open reading frame; sequences 
which do not include a stop codon; sequences which terminate at a stop codon; 
sequences which serve as a promoter, operator or other regulatory control sequence; 
or sequences which are derived from genomic DNA. 

Vectors and Host Cells 
25 The present invention comprises recombinant vectors containing and capable 

of replicating and directing the expression of nucleic acids comprising at least one, or 
part of one of the sequences identified in the Figures in a compatible host cell. The 
insertion of a DNA in accordance with the present invention into a vector may be 
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performed by any conventional means. Such an insertion is easily accomplished when 
both the DNA and the desired vector have been cut with the same restriction enzyme 
or enzymes, since complementary DNA termini are thereby produced. If this cannot 
be accomplished, it may be necessary to modify the cut ends that are produced by 

5 digesting back single-stranded DNA to produce blunt ends, or by achieving the same 
result by filling in the single-stranded termini with an appropriate DNA polymerase. 
In this way, blunt-end ligation with an enzyme such as T4 DNA ligase may be carried 
out. Alternatively, any site desired may be produced by ligating nucleotide sequences 
(linkers) onto the DNA termini. Such linkers may comprise specific oligonucleotide 

10 sequences that encode restriction site recognition 
sequences. 

Any available vectors and the appropriate compatible host cells may be used 
such as those disclosed by Sambrook et al.^ (1989) and Ausubel et ai 9 (1995). 
Commercially available vectors, for instance, those available from New England 
15 Biolabs Inc., Promega Corp., Stratagene Inc. or other commercial sources are 
included. 
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Specific Embodiments 



Example 1 

Production of gene expression profiles generated from cDNAs made with RNA 
isolated from quiescent T lymphocytes (Jurkat lymphocytes) and activated T 
5 lymphocytes. 

Expression profiles of RNA expression levels from Jurkat lymphocytes 
exposed to activating agents, such as 12-0-tetradecanoylphorboM3-acetate (TP A) 
and phytohemagglutinin (PHA), offer a powerful means of identifying genes that are 
specifically regulated during T lymphocyte activation. 

10 Culturine of Jurkat lymphocytes and Treatment with an Activating Agent 

Jurkat lymphocytes (Jurkat clone E6-1 ; ATCC Accession No. TIB 152) were 
grown in 6 X 100 ml RPMI culture media supplemented with 10% fetal bovine serum 
(FBS), penicillin, streptomycin and glutamine. The Jurkat lymphocytes were pelleted 
in 450 ml RPMI media supplemented with penicillin, streptomycin and glutamine 

15 only and were incubated for approximately 24 hours. After 24 hours, FBS was added 
to a final concentration of 10%. The flasks of Jurkat lymphocytes were incubated for 
1 hour. Cells were then pelleted, resuspended in 100 ml PBS, pelleted again and 
resuspended ultimately in 500 ml RPMI supplemented with 10% FBS, glutamine, 
streptomycin and penicillin. A small aliquot of the cells was removed, stained with 

20 trypan blue and counted. The concentration of cells was determined to be 2.45 x 10 6 
(or 1.225 x 10 9 total cells). From the 500 ml suspension, 250 ml was removed to 
another flask and 250 ml fresh 10% FBS RPMI was added to both flasks making 
equal aliquots of 500 ml containing ~1.0 x 10 6 cells each. For the non-stimulated 
(resting or quiescent) Jurkat lymphocytes, five 100 ml flasks were made from one of 

25 the 500 ml flasks. The remaining 500 ml flask of Jurkat lymphocytes was treated 
(stimulated) with 500 jil of 2.0 mg/ml phytohemagglutinin (PHA) and 25 jil of 1.0 
mg/ml tetraphorbol acetate (TP A), which was dissolved in ethanol. Jurkat 
lymphocytes could also be stimulated using ionomycin (Sigma) and TPA (or phorbol 
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derivatives). The 500 ml of stimulated cells were then aliquoted into five 100 ml 
tissue culture flasks. Cells were harvested 4 hours after the addition of PHA and 
TPA. The cells were pelleted at 1.5xg for at least 5 minutes and the supernatant was 
removed by pouring. Pellets were then placed on ice for the RNA to be extracted. 

5 cDNA Preparation 

Total cellular RNA was prepared from the untreated and treated Jurkat 
lymphocytes using the procedure of Newburger et al. t (1981) J. Biol. Chem. 266(24): 
16171-7 and Newburger et al t (1988) Proc. Natl. Acad. Sci. USA 85: 5215-5219. Ten 
micrograms of total RNA, the amount obtainable from about 3x1 0 6 Jurkat 

10 lymphocytes, is sufficient for a complete set of cDNA expression profiles. 

Synthesis of cDNA was performed as previously described by Prashar et aL, in 
WO 97/05286 and in Prashar et al, (1996) Proc. Natl Acad. Sci.USA 93: 659-663. 
Briefly, cDNA was synthesized according to the protocol described in the 
GIBCO/BRL kit for cDNA synthesis. The reaction mixture for first-strand synthesis 

15 included 6 ng of total RNA, and 200 ng of a mixture of 1-base anchored oligo(dT) 
primers with all three possible anchored bases as shown in the table for Figure 1 : 
(ACGTAATACGACTCACTATAGGGCGAATTGGGTCGACTTTTTTTTTTTTTT 
TTTnl wherein nl=A/C or G) (SEQ ID NO.38) along with other components for 
first-strand synthesis reaction, except reverse transcriptase. This mixture was 

20 incubated at 65°C for 5 min, chilled on ice and the process repeated. Alternatively, 
the reaction mixture may include 10 jig of total RNA, and 2 pmol of one of the 2-base 
anchored oligo(dT) primers as a heel such as RP5.0 
(CTCTCAAGGATCTTACCGCTT I8 AT) (SEQ ID NO.39), or 
RP6.0 (TAATACCGCGCCACATAGCAT ]8 CG) (SEQ ID NO.40), or RP9.2 

25 (C AGGGTAGACGACGCTACGCT, 8 GA) (SEQ ID NO. 41) along with other 
components for first-strand synthesis reaction except reverse transcriptase. This 
mixture was then layered with mineral oil and incubated at 65 °C for 7 min followed 
by 50 °C for another 7 min. At this stage, 2 jA of Superscript reverse transcriptase 
(200 units//il; GIBCO/BRL) was added quickly and mixed, and the reaction continued 



» 
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for 1 hr at 45-50 °C. Second-strand synthesis was performed at 16°C for 2 hr. At the 
end of the reaction, the cDNAs were precipitated with ethanol and the yield of cDNA 
was calculated. 

The adapter oligonucleotide sequences were 
5 Al (TAGCGTCCGGCGCAGCGACGGCCAG) (SEQ ID NO.42) and 

A2 (GATCCTGGCCGTCGGCTGTCTGTCGGCGC) (SEQ ID NO.43). One 
microgram of oligonucleotide A2 was first phosphorylated at the 5' end using T4 
polynucleotide kinase (PNK). After phosphorylation, the oligonucleotide was heat 
denatured, and ljug of the oligonucleotide Al was added along with 10* annealing 
10 buffer (1 M NaCl ; 100 mM Tris-HCl, pH 8.0; 10 mM EDTA, pH 8.0) in a final vol of 
H; 20 fA. This mixture was then heated at 65 °C for 1 0 min followed by slow cooling to 

Jj room temperature for 30 min, resulting in formation of the Y adapter at a final 

Nl concentration of 100 ng//xl. About 20 ng of the cDNA was digested with 4 units of 

fn Bgl II or another restriction enzyme in a final vol of 10 fA for 30 min at 37 °C. Two jxl 

£j 15 (-4 ng of digested cDNA) of this reaction mixture was then used for ligation to 100 
f ng (~50-fold) of the Y-shaped adapter in a final vol of 5^1 for 16 hr at 15°C. After 

M ligation, the reaction mixture was diluted with water to a final vol of 80 fA (adapter 

>; ligated cDNA concentration, 50 pg/^ul), heated at 65 °C for 10 min to denature T4 

DNA ligase, and 2 fA aliquots (with 100 pg of cDNA) were used for PCR. 
20 The following sets of primers were used for PCR amplification of the adapter 

ligated 3' -end cDNAs: 

TGAAGCCGAGACGTCGGTCG(T) , 8 nl, n2 (wherein nl, n2 = AA, AC, AG 
AT CA CC CG CT GA GC GG or GT) (SEQ ID NO. 44) as the 3' primer with Al as 
the 5' primer or alternatively, opposing primers used as 3' primers with primer Al.l 
25 serving as the 5 ' primer. To detect the PCR products on the display gel, 24 pmol of 
oligonucleotide Al or Al.l was 5' -end-labeled using 15 fA of [y- 32 P]-ATP 
(Amersham; 3,000 Ci/mmol) and PNK in a final volume of 20 fA for 30 min at 37°C. 
After heat denaturing at 65 °C for 20 min, the labeled oligonucleotide was diluted to a 
final concentration of 2 fuM in 80 fA with unlabeled oligonucleotide Al . 1 . The PCR 
30 mixture (20 fA) consisted of 2 fA (1 00 pg) of the template, 2 fA of 10* PCR buffer 
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(100 mM TrisHCl, pH 8.3; 500 mM KC1), 2 ^1 of 15 mM MgCl 2 to yield 1.5 mM 
final Mg 2+ concentration optimum in the reaction mixture, 200 //M dNTPs, 200 nM 
each 5 ' and 3 ' PCR primers (RP primers), and 1 unit of Amplitaq Gold. Primers and 
dNTPs were added after preheating the reaction mixture containing the rest of the 
5 components at 85 °C. This "hot start" PCR was done to avoid amplification of 

artifacts arising out of arbitrary annealing of PCR primers at lower temperature during 
transition from room temperature to 94 °C in the first PCR cycle. PCR consisted of 5 
cycles of 94°C for 30 sec; 55°C for 2 min, and 72°C for 60 sec, followed by 25 
cycles of 94°C for 30 sec, 60°C for 2 min, and 72°C for 60 sec. A higher number of 

10 cycles resulted in smeary gel patterns. PCR products (2.5 jA) were analyzed on 6% 
polyacrylamide sequencing gel. For double or multiple digestion following adapter 
ligation, 13.2 (A of the ligated cDNA sample was digested with a secondary restriction 
enzyme(s) in a final vol of 20 /A From this solution, 3//1 was used as template for 
PCR. This template volume of 3 jA carried 100 pg of the cDNA and 10 mM MgCl 2 

15 (from the 10* enzyme buffer), which diluted to the optimum of 1 .5 mM in the final 
PCR volume of 20 iA. Since Mg 2+ comes from the restriction enzyme buffer, it was 
not included in the reaction mixture when amplifying secondarily cut cDNA. 

Production of gene expression profiles generated using all 12 possible anchoring oligo 
d(T)nl.n2. 

20 Using the above described methods, RNA was extracted and the cDNA 

profiles prepared using all 12 possible anchoring oligo d(T)nl, n2. 

Figure 1A is an autoradiogram of the expression profiles generated from 
cDNAs made with RNA isolated from control (untreated) Jurkat lymphocytes and 
Jurkat lymphocytes incubated with TPA and PHA. 

25 Figure IB the expression profiles generated from cDNAs made with RNA 

isolated from control (untreated) Jurkat lymphocytes and activated (treated) Jurkat 
lymphocytes is an autoradiogram of activated and quiescent Jurkat lymphocytes using 
the RP 8.6 primers. 
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Such autoradiography gels may be scanned using commonly available 
equipment, such as a UMAX D-1L scanner. Bands which exhibit altered intensities in 
gene expression profiles from quiescent Jurkat lymphocytes as compared to the gene 
expression profile prepared from activated (treated) Jurkat lymphocytes can then be 
5 extracted from the display gel as previously described above. The isolated fragments 
can then be reamplified using 5 ! and 3* primers, subcloned into pCR-Script 
(Stratagene) and sequenced using an ABI automated sequencer. Alternatively, bands 
can be extracted (cored) from the display gels, PCR amplified and sequenced directly 
without subcloning. 



10 Example 2 

Production of gene expression profiles generated from cDNAs made with RNA 
isolated from T lymphocytes exposed to Staphylococci or Streptococci. 
Expression profiles from T lymphocytes (Jurkat lymphocytes) exposed to 
virulent and avirulent Staphylococcus aureus and Streptococcus would allow the 
15 identification of T lymphocyte genes that are specifically regulated in response to 
bacterial infection by these organisms, as well as in response to the superantigens, the 
bacterialenterotoxins, they contain. 

T lymphocytes are cultured as described above in Example 1 . The cells are 
then exposed to either Staphylococci or Streptococci or the enterotoxins therefrom. 
20 When exposing the T lymphocytes to an enterotoxin, the T cells can be directly 
exposed by the addition of enterotoxin to the culture medium or by addition of 
bacterial cells to the culture medium. Before incubation, bacteria are harvested and 
washed in phosphate buffered saline. T lymphocytes are then incubated with the 
bacteria at a ratio which induces activation of the T lymphocytes. For instance, a T 
25 lymphocyte:bacteria ratio of 1 :20 would be appropriate in RPMI 1640 (HEPES 
buffered) in the presence of heat inactivated FBS at 37°C with gentle mixing in a 
rotary shaker bath. 

In instances where an antigen specific response is required, bacterial antigens 
may be presented to the T lymphocytes in the context of class I or class II major 
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histocompatibility antigens (MHC). Many methods to present antigen in the context 
of MHC are readily available, including the exogenous loading of bacterial antigens 
onto antigen presenting cells, including macrophages and dendritic cells. Such 
methods include treating an antigen-presenting cell to enhance expression by the cell 
of empty major histocompatibility complex molecules, followed by reacting the 
treated antigen presenting cell with an antigen extracorporeal^ in the presence of a 
photoactivatable agent and irradiation to form an antigen-associated antigen 
presenting cell. (See, e.g., PCT application number US93/1 1220, publication number 
WO 94/1 1016). Other available methods include the production of antigen fusions to 
such molecules as the Bordetella pertussis adenylate cyclase (see U.S. Patent 
5,679,784), lysosomal/endosomal targeting sequences (see U.S. Patent 5,633,234), the 
production of empty class II heterodimers comprising antigenic peptides (see U.S. 
Patent 5,583,031), or the use of exogenously loaded immortalized dendritic cells (see 
U.S. Patent 5,648,219). 

After non-specific activation by exotoxin or antigen specific activation by a 
Staphylococcal or Streptococcal antigen, expression profiles from the activated cells 
as well as a quiescent T lymphocyte control population may be made by any means 
available in the art. Preferably, gene expression profiles are generated as described in 
Example 1 . After autoradiography or another means of detection, cDNA bands 
corresponding to mRNA species which are differentially expressed are visualized and 
the band(s) isolated or excised for further analysis as in Example 1 . 

Example 3 

Production of gene expression profiles generated from cDNAs made with RNA 
isolated from human T lymphocytes exposed to ionomycin and sn-lj2- 

dioctanoylglyceroL 

Human T lymphocytes were isolated using the method of Subramaniam et ai, 
(1988) Cell Immunol. 1 16: 439-449. The cells were treated with ionomycin and sn- 
1,2-dioctanoylglycerol (diC8) in the same media conditions as described in Example 



WO 99/57130 



-38- 



PCT/US99/09761 



y 



1 . The concentrations of ionomycine and diC8 used to activated the human T 

lymphocytes were as described in Subramaniam et al 

After incubation, RNA was extracted from the treated and untreated human T 

lymphocytes, and gene expression profiles prepared as described in Example 1 . 
5 Figure 2A is an autoradiogram of the expression profiles generated from cDNAs made 

with RNA isolated from control (untreated) Jurkat lymphocytes and human T 

lymphocytes incubated with ionomycin and diC8. 

As exhibited by Figures 2A-2B, the expression of mRNA species in human T 

lymphocytes exposed to ionomycin and diC8 (as indicated by their corresponding 
10 cDNA fragments) is modulated compared to the expression of mRNA species of 

unexposed (quiescent) human T lymphocytes. cDNA bands corresponding to mRNA 

species which are differentially expressed upon exposure to ionomycin and diC8 are 

identified by comparing the gene expression profile for the treated T lymphocytes to 

the gene expression profile from the untreated control T lymphocytes and the band(s) 
15 isolated or excised for further analysis as in Example 1. 

Example 4 

Production of gene expression profiles generated from cDNAs made with RNA 
isolated from activated human T lymphocytes exposed to cyclosporin. 
The ability to compare gene expression profiles from activated T lymphocytes 
20 to gene expression profiles prepared from activated T lymphocytes treated or exposed 
to cyclosporin or another immunoinhibitory agent allows the identification and 
isolation of mRNA species, and therefore genes, that are differentially expressed upon 
the deactivation of a T lymphocyte population. 

Human peripheral blood quiescent T lymphocytes are acquired according to 
25 the procedures set forth in Sehajpal et al, Cell. Immunol 120: 195-204 (1989) and 
Subramaniam et ai, (1988). An aliquot of the T lymphocyte population is activated 
using diC 8 and ionomycin, as described in Example 3. A second aliquot of the T 
lymphocyte population is treated with diC8, ionomycin and cyclosporin A (CsA). 
Gene expression profiles are then prepared from the diC8 and ionomycin treated cells, 
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the diC8, ionomycin and cyclosporin A treated cells and a control untreated quiescent 
T lymphocyte population according to the methods set forth in Example L The 
resulting gene expression profiles are then compared to identify cDNA bands 
corresponding the mRNA species that are differentially regulated upon exposure to 
5 cyclosporin. For instance, cDNA species corresponding to mRNA species that are 
either up-regulated or down-regulated in an activated T lymphocyte population upon 
exposure to cyclosporin are isolated or excised for further analysis, as in Example 1 . 



Example 5 

Method to identify a therapeutic or prophylactic agent that modulates the response 
10 of a proliferating T lymphocyte population. 

The methods set forth in Examples 1 and 3 offer a powerful approach for 
identifying therapeutic or prophylactic agents that modulate the activity of T 
lymphocytes or specific T lymphocyte subpopulations (e.g., T HI or T H2 ) to either 
antigen-specific or antigen-nonspecific activation. For instance, an aliquot of a T 

15 lymphocyte population that has been activated via an antigen specific or antigen- 
nonspecific pathway is exposed to the agent to be tested. Gene expression profiles are 
then prepared as set forth in Example 1 . A gene expression profile is also prepared 
from a control aliquot of the activated T lymphocyte population that has not been 
exposed to the agent to be tested as well as a quiescent T lymphocyte population. By 

20 examining for differences in the intensity of individual bands between the three gene 
expression profiles, agents which up or down regulate the expression of one or more 
mRNA species in activated T lymphocytes are identified. 

As a specific example, screening for agents which down regulate the 
expression of at least one gene associated with T lymphocyte activation, such as the 

25 human serine esterase gene, provides a means for identifying agents which may be 
useful as immunoinhibitory drugs. As set forth in the Table of Figure 4, a cDNA 
band corresponding to the human serine esterase mRNA is among a number of cDNA 
bands that are up-regulated upon T lymphocyte activation. To screen for such agents, 
an aliquot of a T lymphocyte population that has been activated via an antigen specific 
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or antigen-nonspecific pathway is exposed to the agent to be tested. Gene expression 
profiles are then prepared as set forth in Example 1. Gene expression profiles are 
also prepared from a control aliquot of the activated T lymphocyte population that has 
not been exposed to the agent to be tested as well as a quiescent T lymphocyte 
5 population. By examining for differences in the intensity of individual cDNA bands 
identified as being up-regulated in the TableS of Figures 4 or 5, such as the cDNA 
band corresponding the mRNA encoding human serine esterase, agents which down 
regulate the expression of one or more mRNA species in activated T lymphocytes are 
identified. Agents that down regulate expression of human serine esterase, as 
10 demonstrated by decreased band density in the profile produced from activated T 

p lymphocytes exposed to the agent, may be useful in modulating the activation of a T 

iK lymphocyte population. 

CP Example 6 

\j Method to identify a therapeutic or prophylactic agent that modulates the response 

^ 15 of aT lymphocyte population to a pathogen. 

M The methods set forth in Examples 1 and 3 also offer a powerful approach for 

jp identifying therapeutic or prophylactic agents that modulate the activation of T 

2 lymphocytes or specific T lymphocyte subpopulations (e.g., T Hl or T H2 ) to a pathogen 

(e.g., a virus, bacteria or fungi) or parasite. An aliquot of an isolated T lymphocyte 
20 population or subpopulation is exposed to a pathogen or parasite of interest such as 
Staphylococcus aureus, a Streptococcus species or Mycobacterium leprae. Exposure 
of the T lymphocyte population to the pathogen or parasite antigens can be facilitated 
by the presentation of pathogen or parasite antigens to the T lymphocyte population in 
the context of class I or class II MHC using commonly available methods, such as 
25 those disclosed in Example 2. A second aliquot of the same T lymphocyte 
population is then exposed to the pathogen or parasite antigens as above in the 
presence of the agent to be tested. Gene expression profiles are then prepared from 
the two aliquots, as well as from a quiescent T lymphocyte population, by the 
methods set forth in Example 1 . By examining for differences in the intensity of 
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individual bands between the three profiles, agents which up- or down-regulate genes 
of interest in the pathogen or parasite exposed T lymphocytes can be identified. Such 
agents can be used, for example, as immunostimulatory agents to up-regulate antigen 
specific T lymphocyte activation. 

5 Example 7 

Method to identify a therapeutic or prophylactic agent that modulates the activity of 
a T lymphocyte population found in a subject having a sterile inflammatory disease, 
immunodeficiency disorder, autoimmune disorder or T lymphocyte neoplasm. 
The methods set forth in Examples 1 and 3 also offer a powerful approach for 

10 identifying therapeutic or prophylactic agents that modulate the expression of T 

lymphocytes or T lymphocyte subpopulations in subjects exhibiting the symptoms of 
a sterile (non-infectious) inflammatory disease, immunodeficiency disorder, 
autoimmune disorder or T lymphocyte neoplasm (hereinafter in this example to be 
referred to as "T lymphocyte disease"). T lymphocytes from a subject exhibiting the 

15 symptoms of a T lymphocyte disease are isolated according to readily available 
methods, for instance, the methods disclosed in Subramaniam et ai 9 (1988) Cell. 
Immunol. 116:439-449. 

To test agents for their effects on T gene expression, an aliquot of the T 
lymphocyte population isolated from the subject is then treated or exposed to the 

20 agent to be tested. Gene expression profiles are then prepared from the aliquot of the 
T cell population exposed to the agent, from an aliquot of the same T lymphocyte 
population isolated from the subject and from a quiescent T lymphocyte population 
isolated from a normal subject not exhibiting the symptoms of a T lymphocyte disease 
the according to the methods set forth in Example 1. By examining these profiles for 

25 differences in the intensity of bands between the three profiles, agents which up- or 
down-regulate genes of interest in a T lymphocyte population from a subject 
exhibiting the symptoms of a T lymphocyte disease can be identified. Agents that up- 
regulate a gene or genes that are expressed at abnormally low levels in a T 
lymphocyte population from a subject exhibiting the symptoms of a T lymphocyte 
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disease compared to a normal T lymphocyte population, as well as agents that down- 
regulate a gene or genes that are expressed at abnormally high levels in a T 
lymphocyte population from a subject exhibiting the symptoms of a T lymphocyte 
disease are identified. 



Example 8 

Method of isolating agents that modulate virus infected T lymphocytes. 

The methods set forth in Examples 1 and 3 are also useful for identifying 
therapeutic or prophylactic agents that modulate the response of T lymphocytes or T 
lymphocyte subpopulations to viral infection. Many viruses demonstrate a tropism 
for T lymphocyte populations, including but not limited to HIV and HTLV I. 

T lymphocyte populations infected with a given virus are isolated from a 
subject exhibiting symptoms of viral infection. Alternatively, normal uninfected T 
lymphocyte cell lines or uninfected T lymphocytes are isolated from an uninfected 
subject and infected with the virus of interest in vitro. An aliquot of the T lymphocyte 
population infected with the virus is then exposed to the agent to be tested. 
Expression profiles are then prepared from the treated T virus infected T lymphocyte 
aliquot, an aliquot of the virally infected T lymphocyte population which has not been 
exposed to the agent, and a control population of normal quiescent T lymphocytes 
using the methods described in Examples 1 and 3. The profiles are then compared to 
identify agents that beneficially modulate the T lymphocyte response to viral 
infection. For instance, agents are identified that modulate the expression level of at 
least one gene whose expression level is down-regulated upon viral infection. 
Alternatively, agents are identified that modulate the expression level of at least one 
gene whose expression level is up-regulated in response to viral infection. 
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Example 9 

Method of isolating agents that induce differentiation of pre-T lymphocytes and 

cortical or medullary thymocytes. 
The methods set forth in Examples 1 and 3 may be used to identify therapeutic 
5 or prophylactic agents that induce differentiation of pre-T lymphocytes and cortical or 
medullary thymocytes. Pre-T lymphocytes or thymocytes are obtained using flow 
cytometry and antibodies for specific markers that are expressed on these cells. Once 
the pre-T lymphocytes or thymocytes are isolated, an aliquot of the cells are treated or 
exposed to the agent to be tested. An expression profile is then prepared from this 

10 exposed cell population, an aliquot of the same cell population which has not been 
exposed to the agent and a population of differentiated T lymphocytes. This 
population may be selected from the group consisting of cells of the following 
lineages: T H2 , T DTH , T CTL , T H2 , T s , memory T lymphocytes, effector T lymphocytes, 
pre-T lymphocytes, cortical T lymphocytes, medullary T lymphocytes, and peripheral 

15 T lymphocytes. The profiles generated from the exposed T lymphocyte population 
and the control cells are then compared to the expression profiles of the specific 
differentiated T lymphocyte subpopulations (either activated or resting T lymphocyte 
subpopulations) to identify agents that dedicate the pre-T lymphocyte or thymocyte to 
a particular T lymphocyte subpopulation {e.g., T H2 , T DTH , T CTL , T H2 , T s , memory T 

20 lymphocytes, effector T lymphocytes, pre-T lymphocytes, cortical T lymphocytes, 
medullary T lymphocytes, and peripheral T lymphocytes). 

Example 10 

Production of articles to which are attached or adsorbed selected groupings of 
nucleic acids that correspond to the genes whose expression levels are modulated in 
25 a T lymphocyte population that has been exposed to a pathogen, mitogen, 
immunogen, allergen, antigen, or superantigen. 

As set forth in Examples 1 and 3, expression profiles from T lymphocytes, for 
example, that have been exposed to a pathogen, mitogen, immunogen, allergen, 
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antigen or superantigen yield the identity of genes whose expression levels are 
modulated compared to unexposed T lymphocyte populations. 

Suitable solid supports can be prepared by those skilled in the art that 
comprise immobilized representative groupings of nucleic acids corresponding to the 
5 genes or fragments of the genes from T lymphocytes whose expression levels are 
modulated in response to exposure to a pathogen, mitogen, immunogen, superantigen 
or antigen. For instance, representative nucleic acids can be immobilized to any solid 
support to which nucleic acids can be immobilized, such as positively charged 
nitrocellulose or nylon membranes (see Sambrook etal., (1989) as well as porous 
10 glass wafers, such as those disclosed by Beattie (WO 95/1 1755). Nucleic acids are 
O immobilized to the solid support by well established techniques, including charge 
m interactions, as well as attachment of derivatized nucleic acids to silicon dioxide 
f surfaces such as glass which bears a terminal epoxide moiety. A solid support 
Jfj; comprising a representative grouping of nucleic acids can then be used in standard 
%t 1 5 hybridization assays to detect the presence or quantity of one or more specific nucleic 
y- ; acid species in a sample (such as a total cellular mRNA sample or cDNA prepared 

from said mRNA), which hybridize to the nucleic acids attached to the solid support, 
jp Any hybridization methods, reactions, conditions and/or detection means can be used, 
y* such as those disclosed by Sambrook etal., (1989), Ausbel etal., (1987), or 
20 Beattie (WO 95/1 1 755). 

One of ordinary skill in the art may determine the optimal number of genes 
and species of genes should be represented by nucleic acid fragments immobilized on 
the solid support to effectively differentiate between the gene expression profiles of, 
e.g., T lymphocytes exposed to various pathogens and T cells not exposed to the 
25 pathogens. Preferably, at least about 1, 5, 10, 20, 50, 100, 150, 200, 300, 500, 1000, 
10,000, or more preferably, substantially all of the detectable mRNA species in a cell 
sample or population will be present in the gene expression profile or array affixed to 
a solid support. More preferably, such profiles or arrays will contain a sufficient 
representative number of mRNA species whose expression levels are modulated under 
30 the relevant infection or other exposure. In most instances, a sufficient representative 
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number of such mRNA species will be at least about 1, 2, 5, 10, 15, 20, 25, 30, 40, 50, 
50-75 or 100 in number and will be represented by the nucleic acid molecules or 
fragments of nucleic acid molecules immobilized on the solid support. The skilled 
artisan will be able to optimize the number and particular nucleic acids for a given 
5 purpose, i.e., screening for modulating agents, identifying activated T lymphocytes 
and activated T lymphocyte subpopulations, etc. 

Example 11 

Production of solid support articles comprising groupings of nucleic acids that 
correspond to the genes whose expression levels are modulated in a T lymphocyte 

10 population from a subject having a sterile inflammatory disease, autoimmune 
disorder, immunodeficiency disorder or T lymphocyte neoplasm. 

The method, compositions and solid support articles set forth in Example 10 
can also be used to prepare a solid support that presents nucleic acids that correspond 
to genes whose expression levels are modulated in T lymphocytes from a subject 

15 having a T lymphocyte disease compared to normal, quiescent T lymphocytes, to 
prothymocytes, or to various subpopulations of T lymphocytes. Solid supports may 
also be prepared that comprise immobilized representative groupings of nucleic acids 
corresponding to the genes or fragments of said genes from T lymphocytes whose 
expression levels are modulated in the subject. 

20 

Example 12 

Method of diagnosing exposure of a subject to a pathogen. 
Expression profiles of RNA expression levels from T lymphocytes exposed to 
various pathogens or pathogen antigens offer a powerful means to diagnose exposure 
25 of a subject to a pathogen. As set forth in Examples 1 and 3, the display patterns 
generated from cDNAs made with RNA isolated from T lymphocytes exposed to 
pathogenic S. aureus or Streptococcus may exhibit unique patterns of cDNA species 
corresponding to T lymphocyte mRNA species (genes) whose expression levels are 
modulated in response to contact of the T lymphocytes with the bacteria or with the 
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bacterial enterotoxin. The contacting of T lymphocytes with different species of 
pathogens or pathogen antigen may result in the production of expression profiles that 
are unique to each pathogen species or strain. These unique expression profiles are 
useful in diagnosing whether a subject has been exposed to or is infected with a given 
5 pathogen. 

Briefly, expression profiles are produced as set forth in Example 1 and 3, 
using T lymphocyte samples exposed to various pathogens, such as pathogenic strains 
of Staphylococci or Streptococci. T lymphocytes are then isolated from the subject to 
be tested for exposure to a pathogen and an expression profile prepared from the 

10 subject's T lymphocytes by the methods set forth above. The expression profile 
prepared from the subject T lymphocytes can then be compared to the expression 
profiles prepared from T lymphocytes exposed to, for example pathogenic 
Staphylococci or Streptococci, and also compared to expression profiles of cells 
exposed to non-pathogenic strains or quiescent lymphocytes. From such a 

15 comparison, it can be determined which expression profile most closely matches the 
expression profile prepared from the subject, thereby, diagnosing exposure of the 
subject to a pathogen. 



Example 13 

Method of diagnosing a sterile inflammatory disease, autoimmune disorders, or 
20 immunodeficiency disorders in a subject 

Expression profiles of RNA expression levels from T lymphocytes isolated 
from a subject having a sterile inflammatory disease, autoimmune or 
immunodeficiency disorder offer a powerful means to diagnose inflammatory diseases 
(e.g., psoriasis, rheumatoid arthritis, glomerulonephritis, asthma, allergic rhinitis, 
25 cardiac and renal reperfusion injury, thrombosis, adult respiratory distress syndrome, 
inflammatory bowel diseases such as Crohn's disease and ulcerative colitis, 
periodontal disease, etc.). As set forth in Examples 1 and 3, the gene expression 
profiles generated from cDNAs made with RNA isolated from T lymphocytes from 
subjects having various sterile inflammatory diseases, autoimmune disorders or 
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immunodeficiency disorders may exhibit unique patterns of cDNA species 
corresponding to T lymphocyte mRNA species (genes), whose expression levels are 
modulated during the inflammatory process. These unique expression profiles are 
useful in diagnosing whether a subject has a sterile inflammatory disease. 

5 Briefly, expression profiles are produced as previously set forth, using T 

lymphocyte samples isolated from patients with various sterile inflammatory diseases. 
T lymphocytes are then isolated from the patient to be tested {e.g., diagnosed) and an 
expression profile prepared from the patient's T lymphocytes by the methods 
previously set forth. The expression profile prepared from the subject T lymphocytes 

10 can then be compared to the expression profiles prepared from T lymphocytes isolated 
from patients with various sterile inflammatory diseases, immunodeficiency disorders, 
or autoimmune disorders to determine which expression profile most closely matches 
the expression profile prepared from the patient, thereby, diagnosing whether the 
patient has a sterile inflammatory disease, immunodeficiency disorder or autoimmune 

15 disorder. 



Example 14 

Method of diagnosing graft versus host disease (GVHD) in a subject. 
Expression profiles of mRNA expression levels from T lymphocytes isolated 
from a subject suffering from or possibly suffering from graft versus host disease 
20 (GVHD) offer a powerful means of diagnosing GVHD prior to the expression of 
severe symptoms. The early detection of GVHD could aid in decreasing the 
morbidity associated with allografts and xenografts. 

Briefly, expression profiles are obtained from an individual who received an 
autologous graft or less preferably, from a normal person. An expression profile of T 
25 lymphocytes obtained from individuals at various stages in the course of GVHD 
would also be obtained. The two expression profiles would then be compared to the 
expression profile obtained from a patient who recently received an allograft or 
xenograft. From such a comparison, it can be determined whether the patient is 
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rejecting his or her grafted tissue. Such early detection of GVHD may save the life of 
the transplant recipient. 

It should be understood that the foregoing discussion and examples merely 
present a detailed description of certain preferred embodiments. It therefore should be 
apparent to those of ordinary skill in the art that various modifications and equivalents 
can be made without departing from the spirit and scope of the invention. All 
references, articles and patents identified above are herein incorporated by reference 
in their entirety. 



